PISCES: a protein sequence culling server

نویسندگان

  • Guoli Wang
  • Roland L. Dunbrack
چکیده

PISCES is a public server for culling sets of protein sequences from the Protein Data Bank (PDB) by sequence identity and structural quality criteria. PISCES can provide lists culled from the entire PDB or from lists of PDB entries or chains provided by the user. The sequence identities are obtained from PSI-BLAST alignments with position-specific substitution matrices derived from the non-redundant protein sequence database. PISCES therefore provides better lists than servers that use BLAST, which is unable to identify many relationships below 40% sequence identity and often overestimates sequence identity by aligning only well-conserved fragments. PDB sequences are updated weekly. PISCES can also cull non-PDB sequences provided by the user as a list of GenBank identifiers, a FASTA format file, or BLAST/PSI-BLAST output.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PISCES: recent improvements to a PDB sequence culling server

PISCES is a database server for producing lists of sequences from the Protein Data Bank (PDB) using a number of entry- and chain-specific criteria and mutual sequence identity. Our goal in culling the PDB is to provide the longest list possible of the highest resolution structures that fulfill the sequence identity and structural quality cut-offs. The new PISCES server uses a combination of PSI...

متن کامل

Extraction of Motif Patterns from Protein Sequences Using SVD with Rough K-Means Algorithm

Discovering protein sequence motif information is one of the most crucial tasks in bioinformatics research. In this work, we try to obtain protein recurring patterns which are universally conserved across protein family boundaries. In order to generate higher quality protein sequence motif information from Protein Sequence Culling Server (PISCES) dataset, we tried several different advanced clu...

متن کامل

Analysis of vitellogenin gene structure in Caspian roach, Rutilus caspicus (Pisces: Cyprinidae) during exposure to Atrazine

Chemical contamination of aquatic environments to EDCs has become a major focus of environmental toxicology research. The exposure of fishes to estrogenic EDCs in aquatic environments is most frequently assessed by analyzing Vitellogenin (Vg) (the egg yolk precursor protein) expression. Therefore, characterization of Vg gene is of high priority for EDCs bio-monitoring. So, we prepared liver tis...

متن کامل

Sann: solvent accessibility prediction of proteins by nearest neighbor method.

We present a method to predict the solvent accessibility of proteins which is based on a nearest neighbor method applied to the sequence profiles. Using the method, continuous real-value prediction as well as two-state and three-state discrete predictions can be obtained. The method utilizes the z-score value of the distance measure in the feature vector space to estimate the relative contribut...

متن کامل

Tissue-specific transcriptional activity of a pancreatic islet cell-specific enhancer sequence/Pax6-binding site determined in normal adult tissues in vivo using transgenic mice.

A pancreatic islet cell-specific enhancer sequence (PISCES) shared by the rat insulin-I, glucagon, and somatostatin genes binds the paired domain-containing transcription factor Pax6 and confers strong transcriptional activity in pancreatic islet cell lines. It was found recently that Pax6 plays a major role in islet development. In the present study, transgenic mice were used to investigate PI...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 19 12  شماره 

صفحات  -

تاریخ انتشار 2003